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Diboson production (WW + WZ + ZZ) has been observed at the Tevatron in hadronic decay 
modes dominated by the WW process. This paper describes the measurement of the cross section of 
WZ and ZZ events in final states with large fr and using 6-jet identification as a tool to suppress 
WW contributions. Due to the limited energy resolution, we cannot distinguish between partially 
hadronic decays of W Z and ZZ, and we measure the sum of these processes. The number of signal 
events is extracted using a simultaneous fit to the invariant mass distribution of the two jets for 
events with two 6-jet candidates and events with fewer than two fe-jet candidates. We measure a 
cross section a(pp — > WZ, ZZ) — 5.8+3'® pb, in agreement with the standard model. 

PACS numbers: 14.80. Bn, 14.70.-e, 12.15.-y 
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Measurements of diboson production cross sections 
provide tests of the self-interactions of the gauge bosons. 
Deviations from the standard model (SM) prediction for 
the production rates could indicate new physics [TJ [5], 
specifically in hadronic final states [3]. Furthermore, 
given that hadronic final states in diboson production 
are similar to associated Higgs boson production (Higgs- 
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strahlung), pp —> VH + X (V=W, Z), the analysis tech- 
niques described in this Letter are important for Higgs 
boson searches 0]. 

Diboson production has been observed at the Tevatron 
in fully leptonic final states [SJ H] . In the case of partially 
hadronic decay modes, the CDF collaboration observed 
a signal for combined measurement of WW, WZ, and 
ZZ using an integrated luminosity of 3.5 fb -1 where the 
signal is dominated by WW [7[[S]. In this paper, we de- 
scribe a measurement where we isolate the WZ and ZZ 
signals in partially hadronic decay channels by requiring 
the presence of 6-jet candidates. We perform a fit to the 
dijet invariant mass spectrum (rrijj), splitting events into 
two non-overlapping classes: with at least two 6-jet candi- 
dates (two-tag channel), and fewer than two 6-jet candi- 
dates (no-tag channel) [9] . This ensures maximum accep- 
tance to the WZ+ZZ events, and fitting in both the two- 
tag and the no-tag channel improves our signal sensitivity 
significantly compared to using only one channel (with or 
without 6-tagging). The signatures to which we are sen- 
sitive are WZ — > £vbb and ZZ — > vvbb in the two-tag 
channel and all decays with unbalanced transverse mo- 
mentum (fir) i n the no-tag channel (WZ — > tvqq, qq'vv 
and ZZ -> vvqq) [10]. 

II. THE CDF DETECTOR 

The CDF II detector is described in detail else- 
where [llj . The detector is cylindrically symmetric 
around the proton beam axis which is oriented in the 
positive 2 direction. The polar angle, 9, is measured 
from the origin of the coordinate system at the center 
of the detector with respect to the z axis. Pseudora- 
pidity, transverse energy, and transverse momentum are 
defined as r)=— lntan(0/2), Er=Esm9, and pT=psin6, 
respectively. The central and plug calorimeters, which 
respectively cover the pseudorapidity regions of < 1.1 
and 1.1<|?7|<3.6, surround the tracking system with a 
projective tower geometry. The detector has a charged 
particle tracking system immersed in a 1.4 T magnetic 
field, aligned coaxially with the pp beams. A silicon mi- 
crostrip detector provides tracking over the radial range 
1.5 to 28 cm. A 3.1 m long open-cell drift chamber, the 
central outer tracker (COT), covers the radial range from 
40 to 137 cm and provides up to 96 measurements with 
alternating axial and ±2° stereo superlayers. The fidu- 
cial region of the silicon detector extends to |?7| ~ 2, while 
the COT provides coverage for \r]\ < 1. Muons are de- 
tected up to \r/\ < 1.0 by drift chambers located outside 
the hadronic calorimeters. 



III. DATASET AND EVENT SELECTION 

We analyze a dataset of pp collisions corresponding 
to an integrated luminosity of 5.2 fb -1 collected with the 
CDF II detector at a center-of-mass energy of 1.96 TeV. 



Events are selected via a set of triggers with fir require- 
ments. The bulk of the data is collected with a trigger 
threshold fir > 45 GeV. Other triggers have a lower fir 
requirement but also include additional requirements on 
jets in the event, or sometimes correspond to smaller ef- 
fective integrated luminosity. We measure the trigger 
efficiency using an independent Z — > fifi sample and ver- 
ify that the trigger logic used does not sculpt the shape 
of the dijet invariant mass. 

Events with large fir (fir > 50 GeV) and two or more 
jets are selected in this analysis. Jets are reconstructed 
in the calorimeter using the jetclu cone algorithm [T5] 
with a cone radius of 0.4 in (77, 0) space. The energy 
measured by the calorimeter is corrected for effects that 
distort the true jet energy [T3]. Such effects include the 
non-linear response of the calorimeter to particle energy, 
loss of energy in uninstrumcntcd regions of the detec- 
tor, energy radiated outside of the jet cone, and multiple 
proton antiproton interactions per beam crossing. The 
jets must have Et > 20 GeV and be within \t)\ < 2. 
To suppress the multi-jet background contribution, we 
require the azimuthal angle between the fir vector and 
any identified jet, A(j)(fir, jet), to be larger than 0.4 ra- 
dians [14j. The ^r-significance, as defined in [7], mea- 
sures the likelihood that the fir in the event comes from 
actual particles escaping detection as opposed to resolu- 
tion effects and is typically low when fir arises from mis- 
measurements. We require ^-significance to be larger 
than 4 (see [3 [TS]) . Beam halo events are removed by re- 
quiring the event electromagnetic fraction, defined as the 
ratio between the amount of energy measured in the elec- 
tromagnetic calorimeter and the sum of electromagnetic 
and hadronic calorimeter measurements, E^yi/ E tota \, to 
be between 0.3 and 0.85. We remove cosmic ray events 
based on timing information from the electromagnetic 
and hadronic calorimeters. 



IV. SELECTING b QUARK JETS 

To gain sensitivity to the 6-quark content of our jet 
sample, we employ a new multivariate neural network 
based tagger that provides a figure of merit to indicate 
how 6-like a jet appears to be. This tagger is unique 
in its emphasis on studying individual tracks. A more 
detailed description of this tagger may be found in [T0J . 
The tagger identifies tracks with transverse momentum 
pr > 0.4 GeV/c that have registered hits in the inner- 
most (silicon) tracking layers, and uses a track-by-track 
neural network to calculate a figure of merit for a given 
track's "&ness", i.e., the likelihood that it comes from the 
decay of a B hadron. The observables used in the track 
neural network are the transverse momentum of the track 
in the laboratory frame, the transverse momentum of the 
track with respect to the jet axis, the rapidity with re- 
spect to the jet axis and the track impact parameter with 
respect to the primary vertex and its uncertainty. The 
output of the track neural network is a numerical value 
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in the range from -1 to 1. 

Having the track foiesses, we proceed to calculate the 
jet-by-jet frnesses. We use tracks with track-by-track NN 
values greater than -0.5 in the fitting of a secondary ver- 
tex. The observables used as inputs to the jet neural 
network are the top five track frnesses in the jet cone, 
the number of tracks with positive track feness, the sig- 
nificance [T7] of the displacement of the secondary vertex 
from the _B-hadron decay in the xy plane, the invariant 
mass of the tracks used to fit the displaced vertex, the 
number of K$ candidates found in the jet, and muon in- 
formation for semileptonic B decays as described in [TH] . 
We include the number of K$ candidates found since a 
much higher fraction of b jets than non-fe jets contain Kg 
particles. The final output of the algorithm is a number 
between -1 and 1, the feness. By requiring values of feness 
closer to 1, one can select increasingly pure samples of b 
jets. The training for the track neural network as well 
as the jet-by-jet network is performed using jets matched 
to b quarks from Z — » bb events for signal and jets not 
matched to b quarks for background in a pythia ZZ 
Monte Carlo sample. 

To verify that the fe-tagger data response is reproduced 
by the Monte Carlo simulation, we use two control sam- 
ples, one dominated by Z{— > ££) + 1 jet events, and one 
dominated by ti pair events using a lepton + jets selec- 
tion. The former offers a comparison of jets that largely 
do not originate from bottom quarks, while the latter 
compares jets in a heavily fe-enhanced sample. We exam- 
ine the feness distributions in simulation and data and use 
these comparisons to derive a correction to the tagging 
efficiency and mistag rates, the rate of misidentification 
of non-fe jets as b jets, in the Monte Carlo simulation for 
the cuts on the jet feness that define our tagged selection. 
The operating point of our b tagger utilizes a tight cut 
on the highest feness jet in the event, and a looser cut on 
the second highest feness jet. We list the tagging efficien- 
cies and mistag rates for these cuts in Table [TJ Further 
details of their determination are in |16j . We correct the 
MC, as it underestimates the observed mistag rate and 
overestimates the observed efficiency. 





Data 


Scale Factor on MC 


Mistag Rate 1 st jet 
2 nd jet 


1.00 ±0.21% 
8.19 ±0.34% 


1.15 ±0.24 
1.14 ±0.05 


Tag Efficiency 1 st jet 
2 nd jet 


65.2 ± 4.0% 
62.2 ± 5.4% 


0.95 ±0.06 
0.91 ±0.08 



TABLE I: Mistag rates and efficiencies on jet feness cuts de- 
termined from comparisons of data and MC in our Z+ jet 
and ti control samples. As we order jets in feness, the 1 st jet 
is the highest feness jet in the event, and the 2 nd jet is the 2 nd 
highest feness jet in the event. The MC tends to overestimate 
the tagging efficiency and underestimate the mistag rate, and 
so we apply a correction. 



V. BACKGROUND ESTIMATION 

We define our signal sample as events in the 40 < 
rrijj < 160 GeV/c 2 region. In the calculation of the in- 
variant mass rrijj we use the two jets in the events with 
the highest feness score. The final number of events is ex- 
tracted by a simultaneous fit to the dijet invariant mass 
distribution in the two-tag and no-tag channels, as de- 
fined above. Since we apply fe-tagging and allow for two 
or more jets, ti and single t production are a significant 
background. To further suppress these backgrounds, we 
require the events to have no more than one identified 
lepton (electrons or muons), where a very loose lepton 
identification is used to increase the efficiency of this re- 
jection. In addition, the sum of the number of identified 
electrons, muons and jets with > 10 GcV must not 
exceed 4. 

After this selection, we have four major classes of back- 
grounds: 

1. Electroweak (EWK) V boson ±jet processes that 
are estimated using Monte Carlo simulations and 
cross-checked using a 7±jets data set, described be- 
low. 

2. Multi-jet events with generic QCD jet production 
which result in frp due to mis-measurements of the 
jet momenta. This background is evaluated using 
a data-driven method. 

3. Single top and top quark pair production. We es- 
timate this background using a Monte Carlo simu- 
lation. 

4. WW — > Ivjj production. This is indistinguishable 
from the signal in the non-fe-tagged region. This 
background is evaluated using a Monte Carlo sim- 
ulation. 

Monte Carlo simulations used for signal and back- 
ground estimates are performed with a combination of 
PYTHIA [TH], ALPGEN [2U] and MadGraph [2T] event 
generators interfaced with PYTHIA for parton showering. 
The geometric and kinematic acceptances are obtained 
using a GEANT-based simulation of the CDF II detec- 
tor [22]. For the comparison to data, all sample cross 
sections are normalized to the results of NLO calcula- 
tions performed with mcfm v5.4 program [23) and using 
the CTEQ6M parton distribution functions (PDFs) [21] . 

A. Multi-jet background 

Multi-jet production does not typically contain large 
intrinsic f^. The underlying assumption of how multi- 
jet background enters the analysis is that either jets are 
mis-measured, or that a charged or neutral hadron or a 
7 is lost in an uninstrumented region of the detector. We 
expect the dominant effect to be jet mis- measurement. 
Because of the high cross section of multi-jet production, 
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this can be a significant background in a fx+)ets based 
analysis. We derive both the normalization and the dijet 
mass shape of the multi-jet background from data. The 
final measure of the amount of multi-jet background will 
be determined from the extraction fit. 

The two important cuts used to reject this background 
are on the ^--significance and mm(A<f)(fx, jet)). These 
distributions are shown in Fig. [T] which also demon- 
strates our ability to model the multi-jet background. 

To estimate the remaining multi-jet background con- 
tribution, we construct a new variable, fx, to com- 
plement the traditional calorimeter-based fx- The fx 
is defined as the negative vector sum of tracks with 
Pt > 0.3 GeV/c. Tracks used in the calculation of fx 
have to pass minimal quality requirements and be within 
a ±4cr window in the direction along the beamline from 
the primary vertex. 

When comparing the azimuthal angle (<fi) between fx 
and fx, we expect the two quantities to align in the case 
of true fx (e.g., for diboson signal and electroweak back- 
grounds). The difference between these two angles is 
referred to as A4>met- Electroweak backgrounds (and 
diboson signal) will be present in all regions, but will 
dominate at low A4>met due to correctly measured fx 
from neutrinos. To determine the dijet mass shape of 
the multi-jet background, we subtract all other back- 
ground predictions obtained with Monte Carlo simula- 
tions from data, in the multi-jet enhanced region of 
A<Pmet > 1- The normalization of the template ob- 
tained this way is then corrected to account for those 
events with A<f>MET < 1- This correction introduces 
a 7% uncertainty on the normalization of the multi-jet 
background, where the uncertainty was assessed by ob- 
taining the correction factor both in data and in a multi- 
jet Monte Carlo sample. The uncertainty on the shape of 
the distribution is estimated by comparing the difference 
in dijet mass shapes for A^met > 1 and AipMET < 1 
in a control sample defined by 3 < ^/p-significance < 4. 
The resulting multi-jet background dijet mass shape and 
its uncertainties are shown in Fig. [2] and are used as a 
shape uncertainty in the fit. For the two-tag channel we 
do not have enough statistics to measure a shape, so we 
use the same shape as in the no-tag region. 



B. Electroweak Shape Systematic 

Following the method used in the fx+jets analysis 
of [7], we use a 7+jets data sample to check our mod- 
eling of the V^+jet background shape. This is motivated 
by the similarities between the two types of processes. 
While there are some differences (the W and Z bosons 
are massive, the photon is not, and unlike the W the pho- 
ton lacks charge) , these are accounted for by a weighting 
procedure described below. 

Along with differences in the physics, there are also dif- 
ferences in the detector response to 7+jets and V+jets. 
In order to have the 7+jets events emulate the V+jets 



events, the photon Ex is vectorially subtracted from fx- 
Doing this, the 7+jets becomes topologically very simi- 
lar to the Z+jets with a Z decaying to neutrinos, or a 
W+jets with a W decaying to a neutrino and a missed or 
poorly reconstructed lepton. A few other differences ex- 
ist in the selection cuts applied to 7+jets versus fx+]ets 
data, shown in Table [ITJ As with the different approach 
to fx, these cuts are designed to allow for a data sample 
dominated by 7+jets events and having adequate statis- 
tics. 



|^+jets 


7+jets 


fx > 50 GeV 
A^C^i-Jet) > 0.4 
0.3 < -jP^- < 0.85 

^total 

^-significance > 4 
jet frness cuts 


|^ + £frphoton| > 50 GeV 
A<t>{$r + Breton, jet) > 0.4 
0.3 < F EM 

7 passes standard CDF cuts 
Ai?(photon,jet) > 0.7 



TABLE II: List of differences between cuts applied to the fx+ 
jets vs. 7+jets sample. A "— " denotes a lack of cut. 



In order to account for those remaining kinematic 
differences between 7+jets and V+jets, we correct the 
7+jets dijet mass shape in data based on the differ- 
ence between 7+jets and V+jets Monte Carlo simula- 
tions. First, the ratio of the rrijj distributions from 
F+jets Monte Carlo simulation and inclusive 7+jets 
Monte Carlo simulation is obtained. This ratio describes 
the difference in the physics of 7+jets and y+jets events. 
Note that since the 7+jets data sample will be contami- 
nated with 7 + W/Z —> jets events peaking in the signal 
region, their expected contribution is subtracted from the 
7+jets distribution. Next, the V+jets / 7+jcts rrijj ra- 
tio histogram is multiplied bin-by-bin with the 7+jets 
data histogram, in effect sculpting the 7+jets data to 
look like F+jets data. Since the Monte Carlo simulated 
events enter only in the ratio, any production difference 
is taken into account while effects such as detector res- 
olution, PDF uncertainties and modeling of initial- and 
final-state radiation cancel. After we apply this correc- 
tion to the 7+jets data, there is a residual difference, 
shown in Fig. [3j between the corrected 7+jets data and 
our V^+jets simulation, and we take this difference as a 
systematic uncertainty on the shape of the I^+jets back- 
ground prediction. 

VI. SIGNAL EXTRACTION AND RESULTS 

We extract the number of signal events with a binned 
maximum likelihood fit to data using the method de- 
scribed in [25] . We supply template histograms for back- 
grounds and signals and perform a simultaneous fit in 
two channels, defined by different bness thresholds. The 
templates, and the uncertainties on their normalizations, 
are listed below: 
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FIG. 1: Left: no-tag region. Right: 2-tag region. Top row: Minimum azimuthal angular separation mm(A<f)(pT, jet)) between 
all jets with Et > 5 GeV and the missing Et, for events that pass all of the analysis cuts except for the mm(A(/>(pr , jet)) cut. 
The analysis cut is at min(A^(^r, jet)) > 0.4. Bottom row: ^-significance distribution for events that pass all of the analysis 
cuts except for the ^-significance cut. The analysis cut is at ^r-significance > 4. The highest bin is the overflow bin. 



1. EWK background (W/Z+ jets): Normalizations are 
allowed to float in the fit, unconstrained, with no 
correlation between the two tagging channels. 

2. tt and single top: The uncertainties on the theo- 
retical cross sections of these processes are 6% [55] 
and 11% [27]|28], respectively. We combine these 
two processes to a single template and treat these 
uncertainties as uncorrelated, which translates to 
an uncertainty of 5.8% on the normalization of the 
no-tag channel template, and 5.4% on the normal- 
ization of the two-tag channel template, due to the 
relative contributions of each process. 

3. Multi-jet background: We use our data-driven esti- 
mate, Gaussian constrained with an uncertainty of 
7% in the no-tag channel. Because there are very 
few events in the two-tag channel template, we as- 
sign a normalization uncertainty equal to the sta- 
tistical uncertainty (y/N /N, 11%) of the template. 
The uncertainties in the two channels are treated 



as uncorrelated. 

4. WW: We use the NLO cross section and apply a 
Gaussian constraint to the number of WW events 
centered on this value with a width equal to the 
theoretical uncertainty of 6% [53] . 

5. WZ/ZZ signal: As this is our signal, its normaliza- 
tion is allowed to float unconstrained in the fit. We 
assume that each signal process contributes propor- 
tionally to its predicted SM cross section: 3.6 pb 
for WZ and 1.5 pb for ZZ ([23 ) corrected for our 
selection's acceptance and efficiencies. 

In addition to uncertainties on the normalizations of 
each template, we consider other systematic uncertainties 
that may affect the shape of templates. Shape uncertain- 
ties have been described for the electroweak and multi-jet 
backgrounds previously. For top and diboson samples, 
we consider the impact of the jet energy scale and the 
effect that uncertainties due to the differences between 
jet &ness behavior in data and Monte Carlo simulation 
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FIG. 2: The multi-jet background dijet mass template and its 
corresponding shape uncertainties. 



may have on the templates' shapes and normalizations. 
These uncertainties are summarized in Table IIHI All of 
the above uncertainties are treated as nuisance param- 
eters and are incorporated into the fit using a Bayesian 
marginalization technique |25j . 

We choose the jet frness thresholds that define our two 
fitting channels to optimize the significance of our final 
result. The optimization for the two-tag channel points 
to a broad region where the sensitivity is maximized and 
we choose the operating point for our &ness thresholds in 
that region. The optimization favors that all remaining 
events be combined in a no-tag channel, rather than a 
single-tag channel with a low feness threshold. Figure [4] 
shows the results of the fit, and Table IV shows the num- 
ber of fitted events. 



To translate the result of our fit to the data to bounds 
or limits on the cross section of W Z/ZZ production, 
we construct Feldman- Cousins bands by analyzing the 
distribution of fitted (i.e., measured) cross sections in 
pseudo-experiments generated with a variety of scale fac- 
tors on the input signal cross section [53]. When run- 
ning pseudo-experiments, we consider the effect of ad- 
ditional systematic uncertainties that affect our accep- 
tance. These are, in order of increasing significance: jet 
energy resolution (0.7%), fx modeling (1.0%), parton 
distribution functions (2.0%), initial and final state ra- 
diation (2.4%), and luminosity and trigger efficiency un- 



certainties (6.4%). The set of input cross sections in our 
pseudo-experiments range from 0.1 to 3.0 times the stan- 
dard model value with a step size of 0.1. Fig. [5] shows 
the results of our Feldman-Cousins analysis. Based on a 
Monte Carlo simulation, the acceptance times efficiency 
for the WZ and ZZ production is 4.1%, and 4.6%, re- 
spectively. 

Our measured result, using the la bands from the 
Feldman-Cousins analysis, is cr{pp — > WZ, ZZ) = 
5.8^3 q pb, in agreement with the standard model pre- 
diction ctsm = 5.1 pb ([13]). We perform pseudo- 
experiments to calculate the probability (p-value) that 
the background-only model fluctuates up to the observed 
result (observed p-value) and up to the median expected 
s + b result (expected p-value). We observe a p-value 
of 2.7%, corresponding to a signal significance of 1.9a 
where 1.7c is expected. We set a limit on a wz zz < 
13 pb (2.6 x ctsm) with 95% C.L. The techniques used 
here, in particular the b tagging algorithm, are being in- 
tegrated in the current generation of searches for a low- 
mass Higgs boson. 
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Systematic Uncertainties channel 


WZIZZ WW tt & single t EWK Multi-jet 


Cross Section (Norm.) 

2-tag 


Unconstr. ±6% ±5.8% Unconstr. ±7% 
Unconstr. ±6% ±5.4% Unconstr. ±11% 


Jet Energy Scale 

bJ 2-tag 
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bncss cuts (up) no tag 
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TABLE III: A summary of the systematic uncertainties incorporated into the fit of the dijet mass distribution. The cross 
section normalizations of the signal and EWK templates are allowed to float in the fit, unconstrained. There are additional 
uncertainties on the shape of the EWK and Multi-jet templates, as described in the text. There is also an uncertainty on the 
shape of the diboson processes due to the jet energy scale. This shape uncertainty is correlated with the rate uncertainty shown 
here. 
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Process 


Fit iV eV ents 

(no-tag) 


Fit iV CV cnts 

(two-tag) 


EWK 
tt and single t 
Multi-jet 
WW 

wz/zz 


149900 ti™ 

898 +H 

76600 « 
2720 ±200 
1330+ 710 

lOOO _ 690 


749 ±48 
217 1|? 
76.3 ±9.0 

10,5 iu 

52 _23 



TABLE IV: Extracted number of events from the 2-channel fit 
for WZ/ZZ, with all systematic uncertainties applied. Each 
uncertainty is reported to two significant figures, and all event 
totals are reported to the precision reflected in the uncer- 
tainty. 



